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Abstract 

Background: Salmonella enterica subsp. enterica serovars Typhimurium (S. Typhimurium) and Heidelberg (S. Heidelberg) 
are major causes of foodborne salmonellosis, accounting for a fifth of all annual salmonellosis cases in the United States. 
Rapid, efficient and accurate methods for identification are required for routine surveillance and to track specific strains 
during outbreaks. We used Pulsed-field Gel Electrophoresis (PFGE) and a recently developed molecular subtyping 
approach termed CRISPR-MVLST that exploits the hypervariable nature of virulence genes and Clustered Regularly 
Interspaced Short Palindromic Repeats (CRISPRs) to subtype clinical S. Typhimurium and S. Heidelberg isolates. 

Results: We analyzed a broad set of 1 75 5. Heidelberg and 5. Typhimurium isolates collected over a five-year period. 
We identified 21 Heidelberg Sequence Types (HSTs) and 37 Typhimurium STs (TSTs) that were represented by 27 and 
45 PFGE pulsotypes, respectively, and determined the discriminatory power of each method. 

Conclusions: For 5. Heidelberg, our data shows that combined typing by both CRISPR-MVLST and PFGE provided a 
discriminatory power of 0.921 3. Importantly, CRISPR-MVLST was able to separate common PFGE patterns such as 
JF6X01.0022 into distinct STs, thus providing significantly greater discriminatory power. Conversely, we show that 
subtyping by either CRISPR-MVLST or PFGE independently provides a sufficient discriminatory power (0.9345 and 
0.9456, respectively) for 5. Typhimurium. Additionally, using isolates from two S. Typhimurium outbreaks, we 
demonstrate that CRISPR-MVLST provides excellent epidemiologic concordance. 

Keywords: CRISPR subtyping, CRISPR-MVLST, Molecular subtyping, Salmonella Heidelberg, Sa/mone//a Typhimurium 



Background 

Non-typhoidal Salmonella are one of the leading causes of 
bacterial foodborne disease in the United States, account- 
ing for over a million human cases each year [1]. Salmon- 
ellosis symptoms include diarrhea, fever and abdominal 
cramps that occur 12 to 72 hours after infection. Annually, 
Salmonella is responsible for an estimated 20,000 hospital- 
izations and nearly 400 deaths in the United States, with a 
financial burden of approximately $3.3 - 4.4 billion [2,3]. 
Most infections are transmitted via ingestion of con- 
taminated food and, unlike trends with other bacterial 
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foodborne pathogens, the annual incidence rate of sal- 
monellosis has not significantly declined over the past 
decade. Since 2006, nearly a fifth of all salmonellosis cases 
in the United States were caused by Salmonella enterica 
subsp. enterica serovars Typhimurium (S. Typhimurium) 
and Heidelberg (S. Heidelberg) [4]. According to the Cen- 
ters for Disease Control and Prevention, there have already 
been two outbreaks in 2013 where S. Typhimurium and 
5. Heidelberg were responsible [5,6] . 

To limit and reduce the scope of a Salmonella outbreak, 
an efficient and robust surveillance system is vital. During 
epidemiological investigations Salmonella isolates are ser- 
otyped and concurrently subtyped to classify isolates to 
the strain level. An ideal subtyping method has a high dis- 
criminatory power (i.e. can separate all unrelated strains) 
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but is not so discriminatory that it inadvertently sepa- 
rates isolates that are part of the same outbreak (i.e. 
possesses high epidemiologic concordance). There are 
several molecular-based subtyping approaches that have 
been developed, including pulsed-field gel electrophor- 
esis (PFGE) [7], amplified fragment length polymorphism 
(AFLP) [8-10], multiple-locus variable-number tandem- 
repeat analysis (MLVA) [11-17], multiple amplification of 
prophage locus typing (MAPLT) [13,18] and, most re- 
cently, a multiplex DNA suspension array [19]. PFGE 
was adapted to Salmonella in the 1990s and generally 
provides a high discriminatory power for subtyping most 
Salmonella serovars, though it certainly does not provide 
equal sensitivity across all serovars [20]. Despite being 
labor-intensive and time-consuming, conventional sero- 
typing and concurrent PFGE fingerprinting is still con- 
sidered the gold standard for Salmonella subtyping and is 
widely used by public health surveOlance laboratories 
[21-23]. Although PFGE data are uploaded to PulseNet 
USA (http://www.cdc.gov/pulsenet), the national electronic 
network for food disease surveillance that is coordinated 
by the CDC, inter-laboratory comparisons of PFGE finger- 
prints can be ambiguous. 

There are several different PFGE patterns, or pulsotypes, 
though most often a limited number of common patterns 
are associated with the majority of isolates within a given 
serovar. Two recent S. Typhimurium and S. Heidelberg 
foodborne outbreaks in the United States involved con- 
taminated cantaloupe melons (5. Typhimurium, 2012; 228 
reported illnesses) [24] and broiled chicken livers (S. 
Heidelberg, 2011; 190 reported illnesses) [25]. In both 
cases, the individual Xbal PFGE patterns associated with 
each strain were fairly common: for S. Typhimurium, the 
associated PFGE pattern is typically seen in 10-15 cases 
per month [24] and for 5. Heidelberg, the pattern occurs 
even more frequently, 30-40 cases per month [25]. Conse- 
quently, identification of the outbreak strains was particu- 
larly difficult and to more accurately identify isolates that 
were part of the S. Typhimurium cantaloupe outbreak, 
these isolates were also analyzed by MVLA to define the 



outbreak strain. Additionally, another S. Heidelberg out- 
break in 2011, linked to ground turkey, involved isolates 
with two similar but distinctly different PFGE patterns, 
thus showing reduced epidemiologic concordance by this 
subtyping method [26]. This last example may indicate 
evolutionary relatedness between the two sets of isolates 
which, unlike some methods, PFGE cannot really provide. 

The recent outbreak cases described above highlight the 
need for additional subtyping approaches for Salmonella 
that can be used instead of, or as a complement to PFGE 
for routine disease surveillance and outbreak tracking. 
Clustered Regularly Interspaced Short Palindromic Re- 
peats (CRISPRs) are found in -50% of all bacterial species, 
including Salmonella [27]. CRISPR elements comprise 
several unique short sequences, called spacers, which are 
interspaced by conserved direct repeats. In some bacteria, 
homology between a spacer and a complementary tar- 
get nucleic acid results in degradation of the target by 
sequence-specific endonucleases, providing protection 
from exogenous bacteriophage or plasmid DNA [reviewed 
in 28]. Due to both acquisition and loss of these spacer 
elements, CRISPRs represent arguably the most rapidly 
evolving prokaryotic loci [29-31]. 

Sequence analysis of CRISPR loci has been used to sub- 
type clinical isolates of Salmonella [32-34], Escherichia coli 
[35,36], group A Streptococcus [37] and Campylobacter 
species [38]. Salmonella contains two of these non-coding 
loci, which are comprised of direct repeats of 29 nucleo- 
tides separated by spacers of 32 nucleotides (Figure 1). 
Generally, CRISPR polymorphisms between Salmonella 
strains are due to deletion or repetition of one or more 
spacers, termed 'spacer microevolution' [32-34,39,40]. An 
extensive investigation of 738 isolates, representing several 
different serovars, showed that polymorphisms within 
the CRISPR loci correlate highly with serovar, with iso- 
lates from individual serovars bearing distinct CRISPR 
patterns [32]. 

We recently developed a sequence-based subtyping 
assay (multi-virulence locus sequence typing; MVLST) 
for Salmonella that involves the sequencing of two 
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Figure 1 Salmonella CRISPR loci. Salmonella have two CRISPR loci, CRISPR1 and CRISPR2 comprised of direct repeats of 29 nucleotides 
(black diamonds) separated by spacers (empty rectangles). There is an A-T rich leader sequence upstream of each locus (shaded rectangle) and 
the CRISPR-05sociated genes (cos) are upstream of the CRISPR1 locus (grey boxed arrow). Primers used for amplification are shown in blue and 
red for CRISPR1 and CRISPR2, respectively. 
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virulence genes, fimHl ifimH) and sseL, in addition to 
CRISPR sequencing [33]. Preliminary studies showed 
that this approach, termed CRISPR-MVLST, provided 
better discrimination than either CRISPR or MVLST 
alone and, importantly, exhibited strong epidemiologic 
concordance among eight out of nine of the most com- 
mon illness-causing Salmonella enterica serovars [33], 
including both S. Heidelberg and 5. Typhimurium out- 
break strains. Subsequently, among a large number of 
clinical isolates of the highly clonal S. Enteritidis, a com- 
bination of CRISPR-MVLST and PFGE was required to 
provide a sufficient discriminatory power [34]. Among a 
large set of S. Newport clinical isolates, CRISPR-MVLST 
provides similar discrimination to PFGE [41]. 

To further determine the functionality of this new 
subtyping approach, we investigated the discriminatory 
power of both CRISPR-MVLST and PFGE among a lar- 
ger and unbiased collection of clinical S. Typhimurium 
and S. Heidelberg isolates that were collected over a five 
year period. We show here that a combination of both 
CRISPR-MVLST and PFGE is required to achieve an ap- 
propriate discriminatory power for 5. Heidelberg. For S. 
Typhimurium, both subtyping methods independently 
provide a discriminatory power >0.94. Importantly, as 
one of the first applications of CRISPR-MVLST to 
analyze isolates that were part of an outbreak, we were 
able to cluster two different 5. Typhimurium outbreak 
strains. 

Results 

Results of CRISPR-MVLST 

To more accurately determine the discriminatory power 
of CRISPR-MVLST and PFGE for S. Heidelberg and S. 
Typhimurium, we subtyped 89 and 86 isolates, respect- 
ively, that were obtained from the Pennsylvania Depart- 
ment of Health (Table 1). Among the 175 total isolates 
analyzed, we identified 29 CRISPR 1 alleles, 31 CRISPR2 
alleles, 6 fimH alleles and 7 sseL alleles (Table 2). Of these, 
we found 27, 30, 2 and 4 alleles, respectively, that were 
novel and not seen in our previous data sets [33]. In 
total, these alleles defined 58 novel sequence types among 
the two serovars (Tables 3 and 4). The overwhelming 
sequence-type diversity among both of these prevalent ser- 
ovars is provided by genetic variability in the CRISPR loci, 
rather than in either fimH or sseL (Figure 2). We found 
that 88/89 S. Heidelberg isolates had fimH allele 7 and in 
S. Typhimurium there were two predominant fimH alleles, 
allele 6 (52/86 isolates) and allele 8 (28/86 isolates). Simi- 
larly, in S. Heidelberg, 88/89 isolates bore sseL allele 19 
and in S. Typhimurium, 73/86 isolates had sseL allele 15. 
The polymorphisms between different sseL or fimH alleles 
arise from the presence of SNPs with the exception of 
allele 63 that has a single base insertion. No alleles for 
any of the four markers were shared among the two 



different serovars, consistent with previously published 
studies [32-34]. 

5. Heidelberg analysis and sequence type distribution 

CRISPR-MVLST analysis of 89 5. Heidelberg clinical iso- 
lates (representing 27 unique PFGE patterns) resulted in 
21 unique S. //eidelberg Sequence Types (HSTs), HST 7 - 
HST 27 (Table 3). In total, we identified 12 CRISPRl al- 
leles, 8 CRISPR2 alleles, 2 fimH alleles and 2 sseL alleles 
(Table 2). As shown in Figure 2b, most of the allelic diver- 
sity comes from the CRISPRl and CRISPR2 loci. All 12 
CRISPRl alleles and seven of the eight CRISPR2 alleles 
were new, compared to our previous studies [33]. We did 
not find any new fimH alleles in our dataset and only one 
of the two sseL alleles was new. The most frequent ST was 
HST7, occurring in 49/89 isolates (54%). 

Discriminatory power of CRISPR-MVLST and PFGE in 
5. Heidelberg isolates 

The discriminatory power of CRISPR-MVLST among 
the S. Heidelberg isolates was calculated to be 0.6931 
(Figure 3a). The discriminatory power provided by PFGE 
among the same isolates was 0.8149 (Figure 3b). Given 
these low values and insufficient discriminatory power (an 
ideal discriminatory power is >0.95) [42], we combined 
the two typing methods. This combination provided 44 
unique groups with a more satisfactory discriminatory 
power of 0.9213 (Figure 3c), suggesting a 92% confidence 
in ability to separate unrelated isolates. 

Separation of common 5. Heidelberg subtypes 

Among the S. Heidelberg isolates analyzed, the most 
frequent PFGE pulsotype was JF6X01.0022 (42%). We 
were able to further subtype isolates with JF6X01.0022 
pattern into 7 distinct HSTs - HST 7, 9, 12, 14, 19, 26 
and 27 (Figure 3d). Among JF6X01.0022 isolates, the 
two most common HSTs were HST7 (62%) and HST9 
(22%). JF6X01.0058 is also fairly common, occurring in 
8% of isolates studied. With these isolates, we were able 
to further subtype them into 3 distinct HSTs - HST 7, 
21 and 24, with HST21 being the most common (71%). 
Conversely, over half the isolates analyzed have HST 7 
(54%), but by PFGE analysis, these are represented by 
18 different PFGE patterns, the most frequent being 
JF6X01.0022 (48%). Collectively, this data highlights the 
strengths and weakness of each subtyping method. 

5. Typhimurium analysis and sequence type distribution 

CRISPR-MVLST analysis of 86 S. Typhimurium clinical 
isolates (representing 45 unique PFGE patterns) resulted in 
the identification of 37 unique and novel S. Typhimurium 
Sequence Types (TSTs), TST9 - TST41, and TST56 - 
TST58 (Table 4). This included 17 CRISPRl, 23 CRISPR2, 
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Table 1 List of 175 S. Heidelberg and S. Typhimurium isolates from the Pennsylvania Department of Health that were 
analyzed in this study 



Isolate 


Sequence type 


PFGE pattern 


PA region 


Isolation d 


S. Heidelberg 










06E00444 


HST7 


J F6X0 1.0022 


SE 


Mar-06 


06E00725 


HST7 


J F6X0 1.0022 


SE 


Jun-06 


06E01437 


HST7 


J F6X0 1.0022 


SE 


Aug-06 


07E00466 


HST7 


J F6X0 1.0022 


SE 


Apr-07 


07E00758 


HST7 


J F6X0 1.0022 


NC 


May-07 


07E01405 


HST7 


J F6X0 1.0022 


SE 


Aug-07 


07E01505 


HST7 


J F6X0 1.0022 


SE 


Aug-07 


08E00753 


HST7 


JF6X01.0022 


NE 


Jun-08 


08E01373 


HST7 


J F6X0 1.0022 


SE 


Aug-08 


09E00537 


HST7 


J F6X0 1.0022 


SE 


Mar-09 


09E00701 


HST7 


J F6X0 1.0022 


SE 


Mar-09 


09E00750 


HST7 


J F6X0 1.0022 


SE 


Apr-09 


09E00782 


HST7 


J F6X0 1.0022 


SE 


Apr-09 


09E01149 


HST7 


J F6X0 1.0022 


SE 


May-09 


09E01511 


HST7 


J F6X0 1.0022 


SE 


Jun-09 


M0901 9838001 A 


HST7 


J F6X0 1.0022 


SE 


Aug-09 


Ml 00031 50001 A 


HST7 


J F6X0 1.0022 


SE 


Jan-10 


M10014816001A 


HST7 


J F6X0 1.0022 


SE 


Jun-10 


Ml 00 15406001 A 


HST7 


J F6X0 1.0022 


SE 


Jul-10 


Ml 0022 189001 A 


HST7 


J F6X0 1.0022 


SE 


Sep-10 


M11012103001A 


HST7 


J F6X0 1.0022 


SW 


Apr-1 1 


M11017212001A 


HST7 


J F6X0 1.0022 


SE 


Jul-11 


M11021620001A 


HST7 


J F6X0 1.0022 


SW 


Aug-1 1 


06E00846 


HST7 


J F6X0 1.0032 


SW 


Jun-06 


08E00963 


HST7 


J F6X0 1.0033 


SW 


Jul-08 


08E01089 


HST7 


J F6X0 1.0033 


SE 


Jul-08 


07E01378 


HST7 


J F6X0 1.0034 


SW 


Jul-07 


08E00470 


HST7 


J F6X0 1.0034 


NE 


May-08 


08E00508 


HST7 


J F6X0 1.0034 


NE 


May-08 


Ml 0000626001 A 


HST7 


J F6X0 1.0034 


SW 


Dec-09 


07E00964 


HST7 


J F6X0 1.0042 


NW 


Jun-07 


Ml 1025202001 A 


HST7 


J F6X0 1.0042 


SC 


Oct-1 1 


M11027881001A 


HST7 


J F6X0 1.0042 


NE 


Nov-1 1 


07E01870 


HST7 


J F6X0 1.0045 


SC 


Sep-07 


M09021251001A 


HST7 


J F6X0 1.0051 


SE 


Sep-09 


09E00927 


HST7 


JF6X01.0058 


SE 


May-09 


08E00342 


HST7 


J F6X0 1.0080 


SE 


Mar-08 


M11018110001A 


HST7 


J F6X0 1.0087 


NW 


Jul-11 


06E00558 


HST7 


JF6X01.0122 


NW 




07E00680 


HST7 


JF6X01.0122 


SW 


May-07 


07E02336 


HST7 


JF6X01.0161 


SW 


Nov-07 


07E02139 


HST7 


JF6X01.0167 


SW 


Oct-07 
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Table 1 List of 175 S. Heidelberg and S. Typhimurium isolates from the Pennsylvania Department of Health that were 
analyzed in this study (Continued) 



M09033280001A 


HST7 


JF6X01 


.0221 


SE 


Dec-09 


Ml 0004098001 A 


HST7 


JF6X01 


.0246 


SE 


Feb-10 


08E01461 


HST7 


JF6X01 


.0324 


SE 


Aug-08 


09E00128 


HST7 


JF6X01 


.0324 


SE 


Jan-09 


M0901 5668001 A 


HST7 


JF6X01 


.0326 


SE 


Jul-09 


Ml 001 5955001 A 


HST7 


JF6X01 


.0581 


SW 


Jul-10 


06E01523 


HST8 


JF6X01 


.0051 


SE 


Sep-06 


08E00143 


HST9 


JF6X01 


.0022 


NE 


Feb-13 


08E01679 


HST9 


JF6X01 


.0022 


SC 


Sep-08 


06E01915 


HST9 


JF6X01 


.0022 


SC 


Oct-06 


07E00349 


HST9 


JF6X01 


.0022 


SW 


Feb-07 


07E02366 


HST9 


JF6X01 


.0022 


NE 


Dec-07 


09E01408 


HST9 


JF6X01 


.0022 


SW 


Jun-09 


Ml 0006052001 A 


HST9 


JF6X01 


.0022 


SW 


Mar-10 


Ml 0021 328001 A 


HST9 


JF6X01 


.0022 


SC 


Sep-10 


Ml 1000821 001 A 


HST9 


JF6X01 


.0041 


NW 


Jan-11 


06E00519 


HST9 


JF6X01 


.0052 


NE 


Apr-05 


07E00933 


HST 10 


JF6X01 


.0051 


SC 


Jun-07 


08E00107 


HST 11 


JF6X01 


.0085 


NE 


Jan-08 


09E00226 


HST 12 


JF6X01 


.0022 


SE 


Jan-09 


Ml 0020282001 A 


HST 13 


JF6X01 


.0034 


NC 


Sep-10 


07E02483 


HST 14 


JF6X01 


.0022 


SC 


Dec-07 


08E00103 


HST 14 


JF6X01 


.0022 


SE 


Jan-08 


07E00451 


HST 15 


JF6X01 


.0049 


SC 


Mar-07 


08E01904 


HST 15 


JF6X01 


.0049 


SW 


Sep-08 


08E01911 


HST 15 


JF6X01 


.0049 


SW 


Oct-08 


07E01400 


HST 15 


JF6X01 


.0270 


SE 


Jul-07 


Ml 0004892001 A 


HST 17 


JF6X01 


.0041 


SE 


Mar-10 


Ml 1005464001 A 


HST 17 


JF6X01 


.0041 


SW 


Feb-11 


Ml 1000267001 A 


HST 17 


JF6X01 


.0500 


NW 


Dec-10 


M09020244001A 


HST 18 


JF6X01 


.0321 


SW 


Aug-09 


M09022904001A 


HST 19 


JF6X01 


.0022 


NE 


Sep-09 


Ml 1020321 001 A 


HST 20 


JF6X01 


.0042 


SE 


Aug-1 1 


Ml 001 8092001 A 


HST 21 


JF6X01 


.0033 


SW 


Aug-10 


M11011342001A 


HST 21 


JF6X01 


.0058 


SW 


Apr-1 1 


M11013202001A 


HST 21 


JF6X01 


.0058 


SW 


May-1 1 


M11015845001A 


HST 21 


JF6X01 


.0058 


SW 


Jun-11 


Ml 101 5850001 A 


HST 21 


JF6X01 


.0058 


SW 


Jun-11 


M11023722001A 


HST 21 


JF6X01 


.0058 


SW 


Sep-11 


Ml 1005685001 A 


HST 21 


JF6X01 


.0582 


SW 


Feb-11 


M10002453001A 


HST 22 


JF6X01 


.0032 


SC 


Jan-10 


M0901 6444001 A 


HST 22 


JF6X01 


.0033 


NC 


Jul-09 


07E02184 


HST 23 


JF6X01 


.0042 


SE 


Oct-07 


07E01907 


HST 24 


JF6X01 


.0058 


SW 


Sep-07 
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Table 1 List of 175 S. Heidelberg and S. Typhimurium isolates from the Pennsylvania Department of Health that were 
analyzed in this study (Continued) 



06E00416 


HST25 


JF6X01.0172 


NC 


Mar-06 


06E00651 


HST26 


J F6X0 1.0022 


SE 


Jun-06 


06E01299 


HST27 


J F6X0 1.0022 


SE 


Aug-06 


S. Typhimurium 










07E00002 


TST9 


JPXX01.0177 




Dec-06 


07E02276 


TST9 


JPXX01.0177 




Nov-07 


08E02063 


TST9 


JPXX01.0177 




Oct-08 


09E00003 


TST9 


JPXX01.0177 




Dec-08 


M09023403001A 


TST9 


JPXX01.0177 




Sep-09 


07E01490 


TST 10 


J PXXO 1.0003 




Aug-07 


07Em769 


TST 10 


J PXXO 1.0003 




Sep-07 


07E02403 


TST 10 


J PXXO 1.0003 




Dec-07 


08E00363 


TST 10 


J PXXO 1.0003 




Apr-08 


09E00309 


TST 10 


JPXX01.0003 




Jan-09 


Ml 0005050001 A 


TST 10 


J PXXO 1.0003 




Feb-10 


Ml 001 01 38001 A 


TST 10 


J PXXO 1.0003 




Apr- 10 


Ml 00235 15001 A 


TST 10 


J PXXO 1.0003 




Oct- 10 


07E00173 


TST 10 


J PXXO 1.00 18 




Jan-07 


08E00006 


TST 10 


JPXX01.0018 




Dec-07 


M0901 7753001 A 


TST 10 


J PXXO 1.00 18 




Jul-09 


Ml 0003 149001 A 


TST 10 


J PXXO 1.00 18 




Jan-10 


Ml 0006054001 A 


TST 10 


J PXXO 1.0098 




Mar-10 


07E00658 


TST 10 


JPXX01.0256 




Apr-07 


08E00457 


TST 10 


JPXX01.1011 




Apr-08 


Ml 00 18865001 A 


TST 10 


JPXX01.2731 




Aug-10 


07E00234 


TST 11 


JPXX01.0442 




Feb-07 


Ml 000 1003001 A 


TST 11 


J PXXO 1.0442 




Jan-10 


07E00290 


TST 12 


J PXXO 1.0022 




Feb-07 


07E00436 


TST 12 


JPXX01.0146 




Mar-07 


M09028540001A 


TST 12 


J PXXO 1.0 146 




Oct-09 


Ml 00 12000001 A 


TST 12 


J PXXO 1.0 146 




May- 10 


M11018826001A 


TST 12 


J PXXO 1.0604 




Jul-11 


09E01310 


TST 12 


JPXX01.0925 




May-09 


08E02215 


TST 12 


JPXX01.1302 




Nov-08 


08E00255 


TST 13 


JPXXOl.OOOl 




Feb-08 


M11021986001A 


TST 13 


J PXXO 1.0081 




Aug-1 1 


09E00084 


TST 13 


JPXX01.0111 




Dec-08 


07E00868 


TST 13 


J PXXO 1.0206 




Jun-07 


07E00568 


TST 13 


J PXXO 1.0642 




Apr-07 


07E00364 


TST 13 


JPXX01.1212 




Jan-07 


07E01042 


TST 14 


JPXX01.1393 




Jun-07 


07E01180 


TST 15 


J PXXO 1.0003 




Jun-07 


08E01211 


TST 15 


J PXXO 1.0003 




Jul-08 


Ml 1004438001 A 


TST 15 


J PXXO 1.0003 




Jan-11 
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Table 1 List of 175 S. Heidelberg and S. Typhimurium isolates from the Pennsylvania Department of Health that were 
analyzed in this study (Continued) 



M11016520001A 


TST 15 


JPXXOl 


.0070 


Jun-11 


07E01365 


TST 16 


JPXX01 


.0928 


Jul-07 


08E00877 


TST 17 


JPXXOl 


.0006 


Jun-08 


08E01423 


TST 17 


JPXXOl 


.0006 


Aug-08 


07E02063 


TST 17 


JPXXOl 


.0146 


Oct-07 


M09025088001A 


TST 17 


JPXXOl 


.0146 


Oct-09 


Ml 1002975001 A 


TST 17 


JPXXOl 


.0146 


Jan-11 


08E01586 


TST 17 


JPXXOl 


.0416 


Sep-08 


07E02348 


TST 18 


JPXXOl 


.0018 


Nov-07 


08E00618 


TST 19 


JPXXOl 


.0146 


May-08 


M10000110001A 


TST 19 


JPXXOl 


.0146 


Jan-10 


Ml 001 0755001 A 


TST 19 


JPXXOl 


.0146 


May- 10 


M11025544001A 


TST 19 


JPXXOl 


.0146 


Sep-11 


08E00074 


TST 19 


JPXXOl 


.0557 


Jan-08 


Ml 101 1894001 A 


TST 19 


JPXXOl 


.2900 


Apr-1 1 


M0901 8928001 A 


TST 20 


JPXXOl 


.0001 


Aug-09 


08E00152 


TST 20 


JPXXOl 


.0014 


Feb-08 


09E00747 


TST 20 


JPXXOl 


.0014 


Apr-09 


Ml 102961 9001 A 


TST 20 


JPXXOl 


.0014 


Nov-1 1 


Ml 0026894001 A 


TST 20 


JPXXOl 


.0146 


Nov-10 


08E00998 


TST 21 


JPXXOl 


.0604 


Jul-08 


08E02429 


TST 22 


JPXXOl 


.1396 


Dec-08 


09E00422 


TST 23 


JPXXOl 


.1255 


Feb-09 


09E00632 


TST 24 


JPXXOl 


.1975 


Mar-09 


09E00904 


TST 25 


JPXXOl 


.2016 


Apr-09 


M09014919001A 


TST 26 


JPXXOl 


.0083 


Jun-09 


M0901 5997001 A 


TST 27 


JPXXOl 


.0416 


Jul-09 


M09020496001A 


TST 28 


JPXXOl 


.0146 


Aug-09 


M09021 700001 A 


TST 29 


JPXXOl 


.0552 


Sep-09 


M10014370001A 


TST 30 


JPXXOl 


.0333 


Jun-10 


Ml 00 15309001 A 


TST 31 


JPXXOl 


.0003 


Jun-10 


M10016817001A 


TST 32 


JPXXOl 


.0324 


Jul-10 


Ml 0025067001 A 


TST 33 


JPXXOl 


.0359 


Oct- 10 


Ml 0028492001 A 


TST 34 


JPXXOl 


.0060 


Dec-10 


M11001607001A 


TST 35 


JPXXOl 


.0359 


Jan-11 


M11009301001A 


TST 36 


JPXXOl 


.1678 


Mar-l 1 


M11012744001A 


TST 37 


JPXXOl 


.0013 


May-1 1 


Ml 1015184001 A 


TST 38 


JPXXOl 


.1833 


Jun-11 


M11022803001A 


TST 39 


JPXXOl 


.0146 


Sep-11 


Ml 0007760001 A 


TST 40 


JPXXOl 


.2488 


Apr-1 0 


Ml 1006620001 A 


TST 41 


JPXXOl 


.1314 


Feb-11 


Ml 1024498001 A 


TST 42 


JPXXOl 


.0351 


Oct-1 1 


09E01078 


TST 42 


JPXXOl 


.0781 


May-09 
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Table 1 List of 175 S. Heidelberg and S. Typhimurium isolates from the Pennsylvania Department of Health that were 
analyzed in this study (Continued) 

07E00784 TST56 JPXX01.0359 May-07 

08E00321 TST57 JPXX01.1301 Mar-08 

M09031352001A TST 58 JPXX01.0146 Nov-09 

The data are shown in order of Sequence Type (HST or TST) and further sorted by PFGE pattern. 



4 fimH and 5 sseL alleles (Table 2). Of these, the majority 
of CRISPRl alleles were new (15/17 alleles) and all 
CRISPR2 alleles were new (23/23), as compared to our 
previous studies [33]. As with 5. Heidelberg, the majority 
of unique sequence types were defined by polymorphisms 
in either or both of the CRISPR loci (Figure 2c). 

Discriminatory power of CRISPR-iViVLST and PFGE in S. 
Typhimurium isolates 

The discriminatory power of CRISPR-MVLST among 
the S. Typhimurium isolates was 0.9415 (Figure 4a). This 
means that there would be a 94% probability that two 
unrelated isolates could be separated using the CRISPR- 
MVLST scheme. Similarly, for PFGE, the discriminatory 
power among these isolates is 0.9486 (Figure 4b). These 
values suggest that either method can provide sufficient 
discrimination between outbreak and non-outbreak S. 
Typhimurium strains. 

Correlation between different TSTs and PFGE patterns 

We next wanted to investigate whether any correlation 
existed between TSTs and PFGE patterns. To accomplish 
this, we first determined the relationship among different 
TSTs. BURST analysis of all 37 TSTs generated four 
groups (Figure 5a). Of these. Groups 1-3 contain 6-15 
TSTs. Group 4 consists of only two TSTs and BURST was 
unable to assign a core TST. There was also a collection of 
five singletons that BURST did not assign to a group. For 
Groups 1-3, each group comprises a core TST surrounded 
by TSTs that differ from the core by one allele. The num- 
ber of rings in the group demonstrates the number of al- 
lele differences from the core. For example, in Group 1 
TSTs 9, 37, 32, 20, and 14 each differ by one allele at one 
locus from the core TST, TST 13. For group 3, TST 10 is 
the core TST and TSTs 15, 31, 36, 29, 23 and 16 each dif- 
fer from TST 10 at one locus. TST 34, in the outer ring 



Table 2 Number of alleles identified for each of the four 
CRISPR-MVLST markers 



Serovar 


fimH 


sseL 


CRISPRl 


CRISPR2 


S. Heidelberg 


0(2) 


1 (2) 


12 (12) 


7 (8) 


S. Typhimurium 


2 (4) 


3 (5) 


13 (IS) 


19 (19) 


Total 


2 (6) 


4(7) 


25 (27) 


26 (27) 



The total number of alleles for each locus is listed in parentheses with the 
number of alleles that are new in this study, as compared to Liu et al. [33], 
shown to the left. 



differs from the TSTs in the middle ring at one locus and 
from the core at two loci. 

To investigate whether there was any relationship be- 
tween CRISPR-MVLST sequence type and PFGE pat- 
terns, we overlaid our PFGE data to identify isolates 
from different TSTs that have the same PFGE pattern. 
Figure 5a shows that there were seven PFGE pulsotypes 
that could be further separated into TSTs. In the major- 
ity of instances (5/7), identical PFGE patterns were 
found in isolates that had closely related TSTs such as 
JPXX01.0003 and JPXX01.0604 (TSTs 15, 31, 10 and 
TSTs 12 and 21, respectively). 



Table 3 List of all S. Heidelberg CRISPR-MVLST Sequence 
Types (HSTs) that were identified in this study 

HST Frequency Allelic profile 







fimH 


sseL 


CRISPRl 


CRISPR2 


HST 7 


48 


17 


19 


167 


32 


HST 8 


1 


17 


19 


168 


209 


HST 9 


10 


17 


19 


167 


209 


HST 10 


1 


17 


19 


169 


32 


HST 11 


1 


17 


19 


170 


32 


HST 12 


1 


17 


19 


171 


32 


HST 13 


1 


18 


19 


167 


32 


HST 14 


2 


17 


19 


179 


32 


HST 15 


3 


17 


19 


167 


212 


HST 16 


1 


17 


19 


173 


213 


HST 17 


3 


17 


19 


172 


32 


HST 18 


1 


17 


19 


178 


32 


HST 19 


1 


17 


67 


174 


209 


HST 20 


1 


17 


19 


175 


32 


HST 21 


7 


17 


19 


167 


211 


HST 22 


2 


17 


19 


167 


210 


HST 23 


1 


17 


19 


177 


32 


HST 24 


1 


17 


19 


167 


214 


HST 25 


1 


17 


19 


176 


32 


HST 26 


1 


17 


19 


177 


215 


HST 27 


1 


17 


19 


167 


215 



The numbers represent the allelic identifier for the individual CRISPR-MVLST 
markers. The combination of four specific alleles defines a given HST. The 
frequency is the number of times a particular HST was observed among the 
89 S. Heidelberg isolates analyzed. All HSTs identified here were new and not 
seen in previous studies. 
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Table 4 List of all S. Typhiurlum CRISPR-MVLST Sequence 
Types (TSTs) that were identified in this study 

TST Frequency Allelic profile 

fimH sseL CRISPR1 CRISPR2= 



Table 4 List of all S. Typhiurium CRISPR-MVLST Sequence 
Types (TSTs) that were identified in this study (Continued) 



TST 58 ; 


6 


15 


136 


164* 


TST 59 


6 


62 


10 


207* 


TST 50 


6 


15 


166 


208* 



The numbers represent the allelic Identifier for the individual CRISPR-MVLST 
markers. The combination of four specific alleles defines a given HST. The 
frequency is the number of times a particular TST was observed among the 
86 S. Typhimurium isolates analyzed in the first study and does not include 
the frequency of TSTs that were seen in the outbreak study. All TSTs identified 
here were new and not seen in previous studies. ^Some CRISPR2 alleles 
required more than two sequencing primers to cover the whole length of the 
array. Alleles that required three primers are noted with * and the two isolates 
that required seven primers to sequence CRISPR2 are noted with **. The position 
of these primers is shown in Additional file 1. 

Following this, we then generated a dendrogram using 
the Dice coefficient to determine the relationship be- 
tween different PFGE pulsotypes. For clarity, we color- 
coded the PFGE patterns according to the BURST 
Group shown in Figure 5a. As can be seen in Figure 5b, 
closely related CRISPR-MVLST sequence types have 
similar PFGE patterns. 

CRISPR-MVLST analysis of S. Typhimurium outbreak 
Isolates 

Since CRISPR-MVLST and PFGE exhibit a similarly high 
discriminatory ability in S. Typhimurium, we wanted to 
investigate the utility of the former for separating out- 
break isolates. We obtained 30 S. Typhimurium isolates 
from the Pennsylvania Department of Health (Table 5). 
Ten of these were isolates associated with an outbreak 
in 2004 with the cluster designation 0411PAJPX-lc. All 
affected persons were on a bus trip together, though the 
outbreak source was never identified. The remaining 20 
isolates comprised 10 isolates that were linked to a 2009 
live poultry outbreak (cluster 0905PAJPX-1) and 10 con- 
trol isolates that were isolated in the same year but were 
not part of any classified outbreaks. 

CRISPR-MVLST was able to separate the 2004 isolates, 
with each isolate bearing the unique TST59 (Tables 4 and 
5). These isolates were also analyzed by two-enzyme 
PFGE, using Xbal and Blnl. Though they had the same 
TST, two of the isolates, 04E02241 and 04E02239 had dif- 
ferent PFGE patterns with Blnl or Xbal, respectively, and 
are indicated in bold in Table 5. This example shows that 
CRISPR-MVLST provides an epidemiologic concordance 
of 1 (E = 1.0) and for PFGE it is less than 1 (E < 1.0). Add- 
itionally, the Xbal PFGE pattern associated with this 
strain, JPXX01.0146, occurred fairly frequently in our ini- 
tial data set; 12/86 isolates had this pulsotype and we were 
able to separate these into seven different TSTs. 

For the 2009 outbreal< isolates, CRISPR-MVLST cor- 
rectly identified the 10 outbreak isolates (TST42) and 
these all have the same PFGE pattern, JPXX01.0302, thus 



TST 9 5 6 

TST 10 16 8 

TST 1 1 2 6 

TST 1 2 7 6 

TST 1 3 6 6 

TST 14 / 6 

TST 15 4 8 

TST 16 ; 8 

TST 1 7 6 6 

TST 18 ; 8 

TST 1 9 6 6 

TST 20 5 49 

TST 21 / 6 

TST 22 / 6 

TST 23 / 8 

TST 24 ; 6 

TST 25 / 50 

TST 26 ; 6 

TST 27 ; 6 

TST 28 ; 6 

TST 29 ; 8 

TST 30 ; 6 

TST 31 / 8 

TST 32 J 6 

TST 33 1 6 

TST 34 J 8 

TST 35 ; 6 

TST 36 J 8 

TST 37 1 6 

TST 38 J 6 

TST 39 / 6 

TST 40 / 6 

TST 41 / 8 

TST 42 2 6 

TST 56 ; 6 

TST 57 ; 6 



15 


129 


159* 


15 


11 


160 


15 


10 


163* 


15 


10 


164* 


15 


129 


162 


15 


129 


165 


15 


11 


161 


61 
15 


11 

10 


160 

167* 


20 


131 


160 


62 


10 


164* 


15 


129 


162 


15 


132 


164* 


15 


10 


168* 


20 


11 


160 


15 


133 


167* 


20 


134 


169* 


15 


10 


170* 


15 


10 


171* 


15 


10 


172* 


62 


11 


160 


15 


137 


174 


15 


11 


175 


15 


135 


162 


15 


138 


177* 


15 


139 


161 


15 


140 


178* 


63 


11 


160 


15 


141 


162 


15 


10 


179* 


15 


10 


180* 


15 


142 


173* 


20 


143 


166 


15 


10 


181** 


15 


130 


173* 


15 


10 


205** 
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CRISPR-MVLST 
(21) 
D = 0.6931 



PFGE 
(27) 
D = 0.8149 



HST 21 - 
.0058 




Combined 

(44) 
D = 0.9213 

Figure 3 Frequency of S. Heidelberg subtype prevalence generated by CRISPR-MVLST and PFGE. Pie charts showing the number and 
frequency of distinct subtypes defined by a) CRISPR-MVLST, b) PFGE and c) the combination of CRISPR-IVIVLST and PFGE among 89 S. Heidelberg 
isolates. The most frequent subtypes for each method are indicated; .0022 and .0058 represent PFGE profiles JF6X01.0022 and JFeXOI.0058, 
respectively. The number of distinct subtypes defined by each method is listed in parenthesis and the discriminatory power (D) is listed below, 
d) CRISPR-IVIVLST is able to separate the most common S. Heidelberg PFGE pattern JF6X01.0022 into 7 distinct sequence types. 
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TST 




TST 12 




TST 13 



CRISPR-MVLST 
(37) 
D = 0.9415 



PFGE 

(45) 
D = 0.9486 



Figure 4 Frequency of S. Typhimurium subtype prevalence generated by CRISPR-MVLST and PFGE. Pie charts showing the number of 
distinct subtypes defined by a) CRISPR-MVLST and b) PFGE among 86 S. Typhimurium isolates. The most frequent TSTs or PFGE patterns observed 
are indicated. .0003 and .0146 represent PFGE profiles JPXX01.0003 and JPXX01.0146, respectively. The number of distinct subt/pes defined by 
each method is listed in parenthesis and the discriminatory power (D) is listed below. 



for both subtyping methods E = 1.0. Two of the sporadic 
case control isolates were also TST42 (shown in bold in 
Table 5) but these had different PFGE pnlsotypes from the 
outbreak strain, suggesting a lack of discrimination by 
CRISPR-MVLST in this instance. TST42 was seen in two 
isolates in the initial study of 86 S. Typhimurium isolates. 
All isolates within each outbreak were identified using 
CRISPR-MVLST, thus obtaining perfect epidemiological 
concordance with this subtyping method. 

Discussion 

Foodborne illness caused by Salmonella enterica species, 
particularly by S. Typhimurium and S. Heidelberg, ac- 
counts for 18.5% of salmonellosis annually in the United 
States [4]. For accurate outbreak tracking and routine 
disease surveillance, it is critical that we employ rapid, 
efficient and robust subtyping methodologies. PFGE is 
the current gold standard for molecular subtyping of 
Salmonella and other methods include AFLP, MVLA 
and CRISPR-MVLST. 

CRISPR sequence analysis is one of the cheaper and 
faster methods for Salmonella subtyping [22]. For the 
majority of isolates analyzed, CRISPR-MVLST could be 
completed in less than 24 hours, including DNA isola- 
tion and analysis. Additionally, by virtue of their nature, 
sequencing data are more robust and tractable; this 
type of data is unequivocal and, with regards to inter- 
laboratory or database use, is highly consistent. They 
also provide increased downstream utilities that involve 
analysis of sequence information, such as phylogenetic 
studies. This approach is also in line with other high- 
throughput subtyping approaches, including real-time 
CRISPR analysis [32] and whole genome sequence 



analysis [43-47]. Conversely, although protocols exist that 
allow PFGE to be completed in 24 hours, it can often tal<e 
1-3 days, requires skilled personnel, inter-laboratory data 
analysis can be challenging and the data have no utility be- 
yond subtyping. Given the advancement of whole-genome 
sequencing technologies, typing methods based on these 
are in development [48] . While highly discriminatory, lim- 
itations to this approach that are not issues with either 
CRISPR-MVLST or PFGE include the time required for 
analysis and space required for data storage. 

CRISPR spacer analysis alone has been used to 
analyze several different Salmonella serovars [32]. Fabre 
and colleagues showed that among 50 isolates of S. 
Typhimurium and its 1,4, [5],12:i- variant, combined 
CRISPRl and CRISPR2 sequence information is com- 
parable to PFGE (D = 0.88 and 0.87, respectively). Both 
methods were more discriminatory than phage typing 
analysis of the same set of isolates. The same study 
also analyzed spacer content of S. Typhimurium and S. 
Enteritidis from 10 outbreaks and in all cases CRISPR 
sequences exhibited high epidemiologic concordance. 

A preliminary investigation showed that addition of 
CRISPR spacer analysis to an MVLST scheme improves 
discrimination, beyond that provided by either approach 
independently, in eight out of nine of the most common 
illness-causing Salmonella serovars [33]. We wanted to 
extend our evaluation of CRISPR-MVLST utility among 
predominant and clinically relevant Salmonella serovars. 
To date we have tested and compared CRISPR-MVLST 
to PFGE on large numbers of S. Enteritidis [34], S. Newport 
[41] S, Heidelberg and S. Typhimurium isolates. Among 
the total 175 isolates analyzed here, we found significandy 
fewer alleles of fimH and sseL, compared to alleles of 
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Group 1 




JPXX01.055? 
JPXX01.0083 
JPXX01 .0925 
JPXX01.1975 

'JPXX01.064? 
JPXX01.1393 
JPXX01.0206 
JPXX01.0014 
JPXX01.1212 
JPXX01.0001 
JPXX01 .0081 , 
JPXX01.2016 
JPXX01.2731 

Figure 5 Correlation of CRISPR-IVIVLST and PFGE. a) BURST analysis of 37 TSTs identified in this study shows the relationship between 
different TSTs. Within a BURST group, the TSTs within one ring differ from TSTs in an adjacent ring at one of the four CRISPR-MVLST loci. TSTs that 
could not be assigned to a group are listed as singletons. Individual PFGE patterns that are found in isolates that have different TSTs are shown in 
color and the PFGE pulsotype is indicated as the numbers after JPXX01, i.e. JPXX01.0504 is shown as .0504. b) Dendrogram showing the levels of 
similarity between the 45 different PFGE patterns identified. All the PFGE patterns that are found in isolates with TSTs in Groups 1-3 are shaded 
in the corresponding color. The blue asterix represents TST 20, which is in Group 1. 




JPXX01 .017^ 

JPXX01.1314 

|IPXX01.04ia 
pPXX01 .13oj 

JPXX01.0359 
|jPXX01.002^ 
J PXX0 1.0333 



JPXX01 .0442 
JPXX01.0604 
JPXX01.1396 
JPXX01.0781 
JPXX01.0146 
JPXX01 .2900 
JPXX01.1833 
JPXX01.1301 
JPXX01 .0351 




J PXX0 1.0928 
JPXX01.0060 
JPXX01.0070 
JPXX01.0256 
JPXX01.0003 
JPXX01.1011 
JPXX01.1255 
JPXX01.0018 
JPXX01.0098 
JPXX01.1678 
J PXX0 1.0552 

jpxxoi.oml 

JPXX01.0013 
JPXX01.2488 
JPXX01 .0324 



either CRISPR locus (Table 2; Figure 2). Given the reduced 
contribution of the virulence genes to defining STs, their 
addition may seem superfluous within this subtyping 
scheme. However, in this data set, fimH alleles define two 
STs, HST13 and TST20 and sseL alleles define five STs, 
TST16, TST19, TST23, TST29 and TST36. This further 



supports earlier findings showing that addition of MVLST 
to a CRISPR-based subtyping scheme increases discrimin- 
ation in S. Enteritidis [34] as well as among a broad set of 
Salmonella enterica serovars [33]. 

Though the number of isolates for each serovar was simi- 
lar, the number of STs within each serovar is surprisingly 
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Table 5 List of 30 S. Typhimurium isolates used in the outbreak study 



Isolate 


Sequence type 


PFGE-pattern (Xboi) 


PFGE pattern (6/nl) 


Outbreak cluster 


04E02240 


TST59 


JPXX01.0146 


JPXA26.0172 


0411PAJPX-1C 


04E02241 


TST59 


JPXX01.0146 


JPXA26.0294 


0411PAJPX-1C 


04E02243 


TST59 


JPXX01.0146 


JPXA26.0172 


041 1 PAJPX-1 c 


04E02295 


TST59 


JPXX01.0146 


JPXA26.0172 


041 1 PAJPX-1 c 


04E02296 


TST59 


JPXX01.0146 


JPXA26.0172 


0411 PAJPX-1 c 


04E02297 


TST59 


JPXX01.0146 


JPXA26.0172 


0411 PAJPX-1 c 


04 F00368 


TST59 


JPXX01.0146 


JPXA26.0172 


0411 PAJPX-1 c 


04 F00376 


TST59 


JPXX01.0146 


JPXA26.0172 


041 1 PAJPX-1 c 


04 F00381 


TST59 


JPXX01.0146 


JPXA26.0172 


041 1 PAJPX-1 c 


04E02239 


TST59 


JPXX01.0279 


JPXA26.0172 


0411 PAJPX-1 c 


09E00857 


TST42 


JPXX01.0302 


JPXA26.0183 


0905PAJPX-1 


09E01235 


TST42 


J PXX0 1.0302 


JPXA26.0183 


0905PAJPX-1 


09E01308 


TST42 


J PXX0 1.0302 


JPXA26.0183 


0905PAJPX-1 


09E01333 


TST42 


J PXX0 1.0302 


JPXA26.0183 


0905PAJPX-1 


09E01424 


TST42 


JPXX01.0302 


JPXA26.0183 


0905PAJPX-1 


09E01666 


TST42 


JPXX01.0302 


JPXA26.0183 


0905PAJPX-1 


M0901 5209001 A 


TST42 


J PXX0 1.0302 


JPXA26.0183 


0905PAJPX-1 


M09017319001A 


TST42 


J PXX0 1.0302 


JPXA26.0183 


0905PAJPX-1 


M0901 9457001 A 


TST42 


JPXX01.0302 


JPXA26.0183 


0905PAJPX-1 


M09021164001A 


TST42 


JPXX01.0302 


JPXA26.0183 


0905PAJPX-1 


M0901 5294001 A 


TST42 


J PXX0 1.0047 






M0901 9934001 A 


TST42 


JPXX01.0781 






IVIUyU 1 J / zdUU I A 


TCT 1 T 
1 J) 1 \ Z 


JrAAU 1 .UdU4 


iDV A mm 
Jr AAzo.Uzyz 




M0901 9606001 A 


1ST 12 


J PXX0 1.0604 


JPXA26.0174 




M09016911001A 


TST 12 


JPXX01.1214 






09E00951 


TST13 


JPXX01.0001 


JPXA26.0530 




M09019186001A 


TST 13 


J PXX0 1.0946 






09E01471 


TST 15 


J PXX0 1.2095 






M0901 6893001 A 


TST 19 


JPXX01.0146 


JPXA26.0291 




M090 17200001 A 


TST 60 


JPXX01.0359 







The 10 isolates witiiout cluster information represent the sporadic, or non-outbreal( related, isolates used as controls in the study. 



disparate: among 89 S. Heidelberg isolates we identified 21 
HSTs and in 86 S. Typhimurium isolates, we identified 37 
TSTs. This presumably reflects varied levels of clonality in 
different serovars. Independently of the number of STs de- 
fined for either serovar, the CRISPR loci are responsible for 
the vast majority of alleles: (S. Heidelberg - 83.3% and 
S. Typhimurium - 80%) (Figure 2). In 5. Heidelberg, 50% 
of the different alleles identified were CRISPRl alleles. 
Given that CRISPRs are of one of the more dynamic loci in 
bacteria [30,31], this finding is not unexpected. 

Although PFGE was more discriminatory than CRISPR- 
MVLST among 89 S. Heidelberg isolates (D = 0.81 versus 
0.69, respectively), a combination of both techniques pro- 
vided an improved value of 0.92. This represents a 92% 
probability that two unrelated strains can be separated. 



JF6X01.0022 is the most common PFGE pattern in Pulse- 
Net for S. Heidelberg [49] and is seen 30-40 times a 
month by the CDC. In our data set, 42% of the isolates 
have the JF6X01.0022 pattern and using CRISPR-MVLST, 
we were able to further separate these into seven distinct 
CRISPR-MVLST types (Figure 3b and d). Given the fre- 
quency at which this PFGE pattern occurs nationally, not 
all isolates that have this pattern may be associated with a 
specific outbreak, further enhancing the utility of CRISPR- 
MVLST as a complement to PFGE analysis. Collectively, 
these findings in S. Heidelberg show that the JF6X01.0022 
pattern is analogous to the JEGX01.0004 pattern in S. 
Enteritidis, where the latter was observed in 51% of 
isolates analyzed and was separated into 12 distinct 
STs [34]. A proposed improvement for discrimination in 
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S. Heidelberg and S. Enteritidis by PFGE is to increase the 
number of enzymes used for PFGE analysis [50,51], 
though the concurrent use of PFGE and CRISPR-MVLST 
would be much more efficient than this approach. 

Regarding S. Heidelberg, our data are similar to that 
observed in a broad set of S. Enteritidis isolates [34]: 
both serovars exhibit fewer number of STs identified and 
both require combining CRISPR-MVLST and PFGE to 
obtain a sufficient discriminatory power. This presum- 
ably reflects similar levels of clonality in S. Heidelberg 
and S. Enteritidis as compared to more heterogenous 
serovars such as 5. Typhimurium where we observed 
many more STs present within a similar number of iso- 
lates examined. 

Our data show that in S. Typhimurium, the discrimin- 
ation provided by either PFGE or CRISPR-MVLST is 
similar (0.9486 versus 0.9415, respectively). When 
CRISPR-MVLST was applied to outbreak isolates, we 
were able to correctly identify the 20 isolates represent- 
ing the two outbreaks, showing an extremely good 
epidemiologic concordance with this typing method. 
The epidemiologic concordance was better by CRISPR- 
MVLST than PFGE in identifying isolates from the 2004 
bus trip outbreak and both methods had equal epi- 
demiological concordance for the 2009 live poultry out- 
break. Regarding the 2004 outbreak, the majority of 
isolates had the JPXX01.0146 pulsotype. In our initial 
study, this pulsotype was seen frequently, 16% of all iso- 
lates analyzed, and the 14 isolates with this pattern could 
also be represented by 7 distinct TSTs. Conversely, all 
isolates from this outbreak have TST59, which is unique 
and not seen in our initial data set showing that in this 
instance, CRISPR-MVLST may be a better subtyping ap- 
proach. In analyzing the 2009 live poultry outbreak, it 
appears that PFGE is more discriminatory than CRISPR- 
MVLST, as CRISPR-MVLST also identified two non- 
outbreak related isolates as TST42. Given the available 
epidemiological data available, these two isolates do not 
appear to be associated with the outbreak. The fact that 
CRISPR-MVLST works better in some instances than 
others is not surprising and can also occur when other 
subtyping methods are used. 'Problematic' PFGE pulso- 
types also exist and is one reason that second generation 
methods like MLVA and CRISPR-MVLST are being de- 
veloped [33,52]. As a recent example, isolates associated 
with the 2012 S. Typhimurium cantaloupe outbreak, had 
a common PFGE pattern so additional subtyping by 
MLVA was performed to correctly define the outbreak 
strain [24]. That there is a strong association among 
closely related sequence types and closely related PFGE 
patterns for both S. Typhimurium (Figure 5) and S. 
Newport [41] provides further evidence that CRISPR- 
MVLST could serve as an appropriate alternative sub- 
typing method. 



Beyond the data shown here and in further evaluating 
the value of CRISPR-MVLST sequence typing, a recent 
study investigating S. Typhimurium isolates from a var- 
iety of animal sources showed an association of CRISPR- 
MVLST sequence types and resistance to antibiotics 
[40]. As part of that study, the most frequent TSTs were 
TSTIO and TST42, both of which were found in this 
current study. TSTIO was also the most frequent clinical 
sequence type seen in this study (16/86 isolates) but only 
two isolates were TST42. 

Conclusion 

CRISPR-MVLST is a relatively new subtyping approach 
with limited studies conducted in Salmonella that dem- 
onstrate its utility [33,34,39]. Our data here add to this 
body of work by demonstrating its functionality in two 
highly prevalent clinical serovars. Investigation of several 
more outbreak strains using CRISPR-MVLST will eluci- 
date the true capability of this subtyping method. Our 
data here show that CRISPR-MVLST can be used in 
concert with PFGE, as in the case of S. Heidelberg, or 
potentially as an independent subtyping method, as in 
the case of S. Typhimurium. 

Methods 

Bacterial isolates and sample preparation 

A summary of all isolates analyzed in this study is listed 
in Table 5. A total of 89 and 86 clinical isolates of S. 
Heidelberg and S. Typhimurium, respectively, were 
obtained from the Pennsylvania Department of Health. 
These isolates were selected systematically (isolates re- 
ceived closest to the 1**' and 15"^ of each month from 
2005 - 2011 were selected) to represent an unbiased 
collection of human clinical isolates. VYGE-Xbal analysis 
of these isolates was conducted using standard protocols 
[7,53]. All isolates were stored at -80°C in 20% glycerol. 
Isolates were grown overnight in 2 mL LB at 37°C in a 
shaking incubator. DNA was isolated using the Promega 
genomic DNA isolation kit, following the manufacturer's 
directions (Promega, Madison, WI). DNA samples were 
stored at -20°C prior to PCR analysis. 

PCR amplification 

Primers for amplification of all four genomic loci are listed 
in Table 6. PCR reactions were performed in a total vol- 
ume of 25 |il: 1.5 |il template, 0.3 \A Taq (1.5 units; New 
England Bio Labs, Ipswich, MA), 0.2 ^il 10 mM dNTPs, 
1 |il of each 10 |iM primer, 2.5 |il of lOx Taq buffer and 
18.5 |il water. PCR conditions were as follows and the 
annealing temperatures (AT) are listed in Table 6: initial 
denaturation step of 10 minutes at 94°C followed by 35 cy- 
cles of 1 minute at 94°C, 1 minute at AT and extension for 
1 minute (fimH and sseL) or 1.5 minutes (CRISPRl and 
CRISPR2) at 72°C; a final extension step was done at 72°C 
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Table 6 List of primers used in this study for PCR amplification and sequencing of the four CRISPR-MVLST markers 



Primer 


Orientation 


Primer sequence {5-3') 


Annealing temp. 


PCR 


Sequencing 


CRISPRl-5 


Forward 


TGAAAACAGACGTATTCCGGTAGATT 


55.5 


/ 




CRISPRl-1 


Reverse 


CAGCATATTGACAAGGCGCT 




✓ 


J 


CRISPR2-3 


Forward 


ArrGTTGCGATTATGrrGGT 


57 


✓ 




CRISPR2-1 


Reverse 


TCCAGCTCCCTTATGATTTT 








CRISPR2-4 


Reverse 


GCAATACCCTGATCCTTAACGCCA 








CRISPR2-5 


Reverse 


CGACGAAATTAAAACCGAACT 






✓ 


CRISPR2-6 


Forward 


CGGATTCCATGCGI 1 1 ICA 






✓ 


CRISPR2-7 


Forward 


CCGGCGAGGTCAATAAAA 








CRISPR2-8 


Forward 


TGACGCTGGTCTATACCG 






/ 




Forward 


f — rr A rf — rr a r — rrrrr a a 






/ 


CRISPR2-10 


Reverse 


CTCTTCGCACTCrCGATCAA 






J 


fimH-1 


Forward 


AGGTGAACTGTTCATCCAGTGG 


56.7 




J 


fimH-2 


Reverse 


GCGGGaGAACAAAACACAA 




✓ 




sseL-1 


Forward 


AAAATCAGGTCTATGCCrGATTTAATATATC 


60 


✓ 




sseL-2 


Reverse 


GGCTCTAAGTACTCACCATTACT 








sseL-3 


Forward 


ACCAGGAAACAGAGCAAAATGAATATATGT 








sseL-4 


Forward 


TTCTCTCGGTAAACTATCCTATTGGGC 









for 8 minutes. 5 [^1 of each PCR product was electrophor- 
etically analyzed on a 1.2% agarose gel and the remaining 
reaction stored at -20°C. 

DNA sequencing 

PCR products were treated with 10 units of Exonuclease 
(New England Bio Labs, Ipswich, MA) and 1 unit of 
Antarctic alkaline phosphatase (New England Bio Labs, 
Ipswich, MA). The mixture was incubated for 40 minutes 
at 37°C to remove remaining primers and unincorporated 
dNTPs. The enzymes were inactivated by incubating the 
samples at 85°C for 15 minutes. Purified PCR products 
were sequenced at the Huck Institute's Nucleic Acid 
Facility at The Pennsylvania State University using 3' 
BigDye-labeled dideoxynucleotide triphosphates (v 3.1 dye 
terminators; LifeTechnoloties, Carlsbad, CA) and run on 
an ABI 3730XL DNA Analyzer, using ABI Data Collection 
Program (v 2.0). Data was analyzed with ABI Sequencing 
Analysis software (Version 5.1.1). The primers used for se- 
quencing are listed in Table 6. In total, four PCR reactions 
and eight sequencing reactions were conducted for each 
isolate being typed. Additionally, one internal sequencing 
reaction was required for 14/26 S. Typhimurium CRISPR2 
alleles, due to the increased length of this locus. There 
were two alleles (only representing 2/86 S. Typhimurium 
isolates), 181 and 205, which required extra primers due 
to the presence of a duplicated region of the locus. The 
positions of these extra primers are shown in Additional 
file 1: Figure SI. CRISPR2 alleles that were sequenced 
using more than two primers are indicated in Table 3. 



Sequence analysis and sequence type assignment 

Sequences were assembled and aligned using SeqMan 
and MegAlign, respectively (Lasergene 10, DNA Star, 
Madison, WI) and unique alleles were assigned a unique 
numerical designation. All sequences from this study 
were submitted as a batch to NCBI and the accession 
numbers (KF465853 - KF465929) are shown for each 
allele in Additional file 2. For each isolate the combin- 
ation of allelic types at all four loci defines the serovar- 
designated sequence type (ST) (Tables 2 and 3), with 
each unique allelic type assigned a different ST number. 
The presence of a SNP in any marker was sufficient to 
define a new allele. Analysis of CRISPRl and CRISPR2 
was performed using CRISPR-finder (http://crispr.u- 
psud.fr/Server/). We did not identify any SNPs within 
either CRISPR locus that defined any allele. Allelic dif- 
ferences occurred from deletion of one or more spacers, 
addition of a spacer or duplication/triplication of a 
spacer. Discriminatory power was calculated using the 
method described by Hunter and Gaston [54], with 
strains defined as either unique STs or unique PFGE 
patterns. 

Relationships between TSTs were calculated using 
BURST (www.pubmlst.org/analysis/), with a group defin- 
ition of n-1. Unique PFGE patterns, or pulsotypes, were 
defined by PulseNet, using the Dice coefficient with an 
optimization of 1.5% and a position tolerance of 1.5%. The 
difference of one band is sufficient to call two PFGE pat- 
terns different. PFGE dendrograms were generated using 
BioNumerics v. 6.6. 
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S. Typhimurium outbreak study 

A summary of 305. Typhimurium outbreak isolates that 
were obtained from the Pennsylvania Department of 
Health is listed in Table 4. Ten of these isolates associated 
with an outbreak in 2004 (cluster 0411PAJPX-lc) where 
affected patients had been on a bus trip together, though 
no vector was ever identified. Another 10 isolates were 
linked to an outbreak in 2009 (cluster 0905PAJPX-1), 
which was associated with live poultry. The remaining 10 
isolates represent sporadic case isolates, also from 2009 
but were not associated with the 0905PAJPX-1 outbreak 
and thus served as controls. The isolates were cultured as 
described above. 

Consent and institutional review board (IRB) approval 

This study design was reviewed by the Pennsylvania De- 
partment of Health IRB and was determined to be ex- 
empt under federal regulations as it falls within the 
category "research that involves the collection or study 
of existing data, documents, records, pathological speci- 
mens, or diagnostic specimens where the information is 
recorded by the investigator in such a manner that sub- 
jects cannot be identified, directly or through identifiers 
linked to the subjects". 
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